Goto

Collaborating Authors

 morse code


China's Plan to Make AI Watermarks Happen

WIRED

These are some of the things the Chinese government wants AI companies and social media platforms to use to properly label AI-generated content and crack down against misinformation. On September 14, China's Cyberspace Administration drafted a new regulation that aims to inform people of whether something is real or AI. As generative AI tools get increasingly advanced, the difficulty to discern whether content is AI-generated is causing all kinds of serious issues, from nonconsensual porn to political disinformation. China's is not the first regime to tackle this issue--the European Union's AI Act, adopted this March, also requires similar labels; California passed a similar bill this month. And China's previous AI regulations also briefly mentioned the need for gen-AI labels. However, this new policy outlines more details of how AI watermarks should be implemented by platforms.


HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows

arXiv.org Artificial Intelligence

Despite recent advancements in large language models (LLMs), their performance on complex reasoning problems requiring multi-step thinking and combining various skills is still limited. To address this, we propose a novel framework HDFlow for complex reasoning with LLMs that combines fast and slow thinking modes in an adaptive manner. Our approach consists of two key components: 1) a new approach for slow, deliberate reasoning called Dynamic Workflow, which automatically decomposes complex problems into more manageable sub-tasks and dynamically designs a workflow to assemble specialized LLM or symbolic reasoning tools to solve sub-tasks; 2) Hybrid Thinking, a general framework that dynamically combines fast and slow thinking based on problem complexity. Finally, we propose an easy-to-scale method for automatically synthesizing a large-scale dataset of 27K challenging reasoning problems for complex reasoning and a hybrid thinking tuning method that trains smaller LLMs on this dataset to internalize the fast/slow hybrid reasoning strategies. Experiments on four reasoning benchmark datasets demonstrate that our slow thinking with dynamic workflows significantly outperforms Chain-of-Thought, and hybrid thinking achieves the highest accuracy while providing an effective balance between computational efficiency and performance. Fine-tuning using our hybrid thinking approach also significantly boosts the complex reasoning capabilities of open-source language models. The results showcase the promise of slow thinking, dynamic workflows, and hybrid thinking in expanding the frontier of complex problem-solving with LLMs\footnote{Code and data will be released at \url{https://github.com/wenlinyao/HDFlow}.}.


Morse Code-Enabled Speech Recognition for Individuals with Visual and Hearing Impairments

arXiv.org Artificial Intelligence

The proposed model aims to develop a speech recognition technology for hearing, speech, or cognitively disabled people. All the available technology in the field of speech recognition doesn't come with an interface for communication for people with hearing, speech, or cognitive disabilities. The proposed model proposes the speech from the user, is transmitted to the speech recognition layer where it is converted into text and then that text is then transmitted to the morse code conversion layer where the morse code of the corresponding speech is given as the output. The accuracy of the model is completely dependent on speech recognition, as the morse code conversion is a process. The model is tested with recorded audio files with different parameters. The proposed model's WER and accuracy are both determined to be 10.18% and 89.82%, respectively.


Machine Learning System Uses Images To Teach Itself Morse Code

#artificialintelligence

Conventional wisdom holds that the best way to learn a new language is immersion: just throw someone into a situation where they have no choice, and they'll learn by context. Militaries use immersion language instruction, as do diplomats and journalists, and apparently computers can now use it to teach themselves Morse code. The blog entry by the delightfully callsigned [Mauri Niininen (AG1LE)] reads like a scientific paper, with good reason: [Mauri] really seems to know a thing or two about machine learning. His method uses curated training data to build a model, namely Morse snippets and their translations, as is the usual approach with such systems. But things take an unexpected turn right from the start, as [Mauri] uses a Tensorflow handwriting recognition implementation to train his model.


When WiFi Won't Work, Let Sound Carry Your Data

WIRED

If you've ever struggled to pair your phone with a Bluetooth speaker or set up a wireless printer, you know that it's often easier to connect to a server halfway around the world than to a gadget across the room. That's a problem as we increasingly use our phones to pay for stuff, unlock doors, and control everything from televisions to thermostats. No one wants to wait for coffee because the cash register can't detect their phone, or shiver in the cold because their watch is trying to connect to their neighbor's door lock instead of their own. Multiple wireless technologies have emerged in recent years to tackle this problem, including Bluetooth, LoRa, and NFC. These technologies are all based on radio frequencies. But a growing number of businesses, from Ticketmaster to Google to nuclear-power plants, are turning to a simpler solution: sound.


Why Morse code is actually a really weird way to communicate

Popular Science

Time is to speech and music recognition as space is to visual object recognition. We can think of recognizing a face in a drawing as a spatial problem--that is, the relevant information is contained in the spatial relationships between all the elements of the drawing. It is also a hierarchical problem: low-level information (lines and curves) must be integrated into a unified image. A circle is a circle, but two side-by-side pairs of concentric circles become eyes; place those in a larger circle and you have a face, and so forth until we have a crowd of people within a scene. Speech and music are the temporal equivalent of recognizing a visual scene: they require solving a hierarchy of embedded temporal problems.


The Easter egg puzzles that are hiding inside video games

New Scientist

Brad Hill knew what the jumbled letters represented, but he had no idea where they would lead. A handful of players had started finding strange things in Trials HD, a game released in 2009 by Finnish studio RedLynx in which you drive stunt bikes over outlandish obstacle courses. The mathematical patterns and cryptic messages discovered in the game's more hard-to-reach places hinted at something beyond high scores. One player uploaded a screenshot of some brass plaques he had found strewn on the ground after crashing his bike through a trapdoor. They were covered in what looked like a coded message and the player wanted to know what it meant.


Pandemonium: A Paradigm for Learning

Classics

G. Selfridge was born in London 10 May PANDEMONIUM: A PARADIGM FOR LEARNING O. G. SELFRIDGE INTRODUCTION WE are proposing here a model of a process which we claim can adaptively improve itself to handle certain pattern recognition problems Which cannot be adequately specified in advance. Such problems are usual when trying' to build a machine to Imitate any one of a very large class of human data processing techniques. A speech typewriter is a good example of something that very many people have been trying unsuccessfully to build for some time. We do not suggest that we have proposed a model which can learn to typewrite from merely hearing speech. Pandemonium does not, however, seem on paper to have the same kinds of inherent restrictions or inflexibility that many previous proposals have had. The basic motif behind our model is the Inn of parallel processing. This is suggested on two grounds: first, it is often easier to handle data in a parallel manner, and, indeed, it is usually the more "natural" manner to handle it in; and, secondly, it is easier to modify an assembly of quasi We are not going to apologize for a frequent use of anthropomorphic or biamorphic terminology.